The South African directory enquiries (SADE) name corpus

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The NCHLT speech corpus of the South African languages

The NCHLT speech corpus contains wide-band speech from approximately 200 speakers per language, in each of the eleven official languages of South Africa. We describe the design and development processes that were undertaken in order to develop the corpus, and report on associated materials such as orthographic transcriptions and pronunciation dictionaries that were released as part of the corpu...

متن کامل

Corpus-based Name Standardization

Variation in the spelling of names has various origins, many of which many are difficult to describe by rule. We present a method that uses both rules and a similarity measure of a probabilistic nature, and which can make use of existing onomastic corpora. Rules first convert an unknown name to a semiphonemic form. Then a selection is made of possible candidates in the onomastic corpus. For thi...

متن کامل

The Effect of South African Geopolitical Position in the Development of Cinema in South Africa

This paper will discuss about the role of the geopolitical location of South Africa in the development of movie and the cinema industry in this country. Despite of bringing ci-nema to South Africa by white Europeans, but the development of this phenomenon is mostly due to the geopolitical position of this country. It is interesting to know that ci-nema reached Africa in much the same time as it...

متن کامل

Corpus-Based Pinyin Name Resolution

For readers of English text who know some Chinese, Pinyin codes that spell out Chinese names are often ambiguous as to their original Chinese character representations if the names are new or not well known. For English-Chinese cross language retrieval, failure to accurately translate Pinyin names in a query to Chinese characters can lead to dismal retrieval effectiveness. This paper presents a...

متن کامل

Predicting name pronunciation for a reverse directory service

Text-to-speech systems are nonnally not suited for name pronunciation in, for example, an automatized reverse directory service. A new project has been started to alleviate these problems. First names, sumames and street names corpora from the Greater Stockholm telephone directory has been studied. The sumames corpus has been tagged as to langnage origin. Both rule based methods and an artifici...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Language Resources and Evaluation

سال: 2019

ISSN: 1574-020X,1574-0218

DOI: 10.1007/s10579-019-09448-6